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MODinED GREEN FLUORESCENT PROTEINS 
Background of the Invention 
5 This invention relates generally to the fields of biology and chemistry. 

More panicularly. the invention is directed to modified fluorescent proteins and to 
methods for the preparation and use thereof. 

This invention was made with Government support imder Grant No. 
NS27177, awarded by the National Institute of Health. The Government has 
10 certain rights in this invention. 

In biochemistry, molecular biology and medical diagnostics, it is often 
desirable to add a fluorescent label to a protein so that the protein can be easily 
tracked and quantified. The normal procedures for labeling requires that the 
protein be covalently reacted in vitro with fluorescent dyes, then repurified to 
15 remove excess dye and any damaged protein. If the labeled protein is to be used 
inside cells, it usually has to be microinjected; this is a difficult and time- 
consuming operation that cannot be performed on large numbers of cells. These 
problems may, however, be elimmated by joining a nucleotide sequence coding for 
the protein of interest with the sequence for a naturally fluorescent protein, then 
20 expressing the fusion protein. 

The green fluorescent protein (GPP) of the jellyfish Aequorea victoria is a 
remarkable protein with strong visible absorbance and fluorescence from a /?- 
hydroxybenzylideneimidazolone chromophore, which is generated by cyclization 
and oxidation of the protein's own Ser-Tyr-Gly sequence at positions 65 to 67. A 
25 cDNA sequence [SEQ ID N0:1] for one isotype of GPP has been reported 
[Prasher, D. C. et al.. Gene 111, 229-233 (1992)]; cloning of this cDNA has 
enabled GPP expression in different organisms. The finding that the expressed 
protein becomes fluorescent in cells from a wide variety of organisms [Chalfie, M. 
et al.. Science 263, 802-805 (1994)] makes GPP a powerful new tool in molecular 
30 and cell biology and indicates that the oxidative cyclization must be either 
spontaneous or dependent only on ubiquitous enzymes and reactants. 
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A major question in protein photophysics is how a single chromophore can 
give widely different spectra- depending on its local protein environment. This 
question has received the most attention with respect to the multiple colors of 
visual pigments based on retinal [Merbs. S. L. & Nathans, J. Science 258, 464-466 
. (1992)1, but is also important in GFP. The GFP from ^e^-orea and that of the sea 
pansy ReniM remforms share the same chromophore, yet Ae,uorea GFP has two 
absorbance peaks at 395 and 475 «n, whereas Renilla GFP has only a single 
absorbance peak at 498 mn. with about 5.5 fold greater monomer extmction 
coefficient than the major 395 mn peak of the Aequorea protein [Ward. W. W. m 
, Biolumnescence and CkenUumnescence (eds. DeUca. M.A. & McElroy, W.D.) 
235-242 (Academic Press. New York, 1981)]. The spectra of the isolated 
chromophore and denamred protein at neutral pH do not match the spectra of either 
native protein (Cody. C. W. et al., Biochemisvy 32. 1212-1218 (1993)]. 

For many practical applications, the spectrum of Renilla GFP would be 
5 preferable to that of Aequorea, because wavelength discrimination between 
different fluorophores and deteaion of resonance energy transfer are easier .f the 
component spectra are tall and narrow rather than low and broad. Furthermore, 
the longer wavelength exciution peak (475 mn) ofAe^juorea GFP is almost ideal 
for fluoi«cein filter sets and is resistant to photobleaching. but has lower 
20 amplimde than the shorter wavelength peak at 395 nm. which is more susceptible 
to photobleaching IChalf.e et al. (1994). supraY For all these reasons, it would 
Clearly be advantageous to convert the Aequorea GFP excitation spectrum to a 
single peak, and preferably at longer wavelengths. 

There is also a need in the art for proteins which fluoresce at differem 
25 wavelengths. Variants of fluorescem proteins with differem colors would also be 
very useful for simultaneous comparisons of multiple protein fates, developmental 
lineages, and gene expression levels. 

Accordingly, it is an object of the present invention to provide improved 
fluorescem proteins which do not suffer from the drawbacks of native Aequorea 
30 GFP. 
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Summary of the Invention 

In accordance with the present invention, it has been determined that 
particular modifications in the polypeptide sequence of an Aequorea wild-type GFP 
[SEQ ID N0:2] lead to formation of products having markedly different excitation 
5 and emission spectra from corresponding products derived from wild-type GFP. 
Visibly distinct colors and/or increased intensities of emission make these products 
useful in a wide variety of contexts, such as tracking of differential gene expression 
and protein localization. 



10 Brief Description of the Drawings 

The invention may be better understood with reference to the accompanying 
drawings, in which: 

Fig. 1 compares different versions of GFP by gel electrophoresis 
and Coomassie blue staining; 
15 Fig. 2 illustrates a proposed biosynthetic scheme for GFP; 

Figs. 3a and 3b illustrate the excitation and emission spectra of wild- 
type and a first group of mutant GFPs; 

Figs. 4a and 4b illustrate the excitation and emission spectra of wild- 
type and a second group of mutant GFPs; 
20 Fig. 5 illustrates the rate of fluorophore formation in the wild-type 

GFP and the Ser 65-*Thr mutant; 

Figs. 6a and 6b illustrate the behavior of wild-type GFP and the Ser 
65-*Thr mutant, respectively, upon progressive irradiation with ultraviolet 
light; and 

25 Fig. 7 ilhistrates fluorescence excitation and emission spectra of a 

third group of GFP mutants. 
Detailed Despnption 9f frg InventjQn 

GFP was expressed in £. coli under the control of a T7 promoter for 
quantitative analysis of the properties of the recombinant protein. Gel 
30 electrophoresis under denaturing conditions showed protein of the expected 
molecular weight (27 kDa) as a dominant band (Fig. 1), which could be quantified 
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staply by d.»tome«y of »i»ing witt, Ooon«s» blue. Soluble re«»nbu«.. 
GF^ proved <o bave Uto-ic.. sp«o. ». tt.= s«ne or «.n sHghUy more 
fluor««nc. per mole of pro^ta. as OFP purif»d from Aeguor^a ^aonc . sh„«>ng 
^ U« soluble pro^ta in £. cM u«i.r,oes folding ^ o.,d.«ve 

5 cyctotionwilhashigbanefnciencjrasinlhejeUynsl.. 

TT,. bacteria also contained inclusion bodies consisting of protetn 
Mlstinguisbable from jellyfisb or soluble recombinant protein on ^« 
<Fi, 1) However, this material was completely ™>n-fluorescent. lacked the 
V IsL absc^ance bands of the chro„»phore, and could no. be made fluo«s«nt 
,0 even when soWUzed and subjects, to protocols tba. renamre GFP IWard^ V. W . 
i Bokman. S. H.. «oc»emi„^ «35^540 (1982); Surpin, M. A. & Ward. 
W W FHa,0CMm. PHM. 49, Absnact. 25S (1989)]. Therefore, protem 
from inchtsion bodies seemed p«ma«e«ly unable «> generate the mten^ 
chromophore. An interesUng intem^iate stage in protein mamration could b^ 
,5 genemtedbygrowingthebaceriaanaerobically. The soluble protein agamlo^ 
tt„ same as GFP on denamring gels (Fig. 1) bu, was non-fluorescent. In thts case, 
fluorescence gradually developed after admission of air, even when fr«h protem 
syuth^is was blocked using puromycin and tetracycline. EvidenUy. the soluble 
non-fluorescem protein sy,«hesized ur-er anaerobic conditions was teady to 
20 become flu^escen. once amtospheric oxygen was readmined. The fluorescence 

per proteinmolecule approached itsfinalasymmotic value withasingle^xpot^ntul 
L course and a rate constant of 0.24 ± .06 hr' (at 22-C) measured e-ther m 
huac. cells with proteitHsynthe^s i,*ibi«>rs or in a lysate in whKh the soluble 
p„,ems ^ cofactors w«e a thousar- fold more dilute. Such pseudo-frst order 
25 Idnetics strongly suggest that no enryn»s or coftc«,rs arc necessary for the fma, 
Step of fluorophore formation in GFP. 

It has thus been determined that formation of the final fluorophore require 

««H nrnceeds in wild-tvpe protein with a lime constant of -4 
molecular oxygen and proceeds m wua lyv^v . , • „ 

K.rin nn This was independent of dilution, raiplymg that 
h at 22 °C and atmospheric pO,. mis was iuuch 

30 the oxidation does not require enzymes or cofactors. 
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A molecular interpretation is presented in Fig. 2. If the newly translated 
apoprotein (top left) evades precipitation into inclusion bodies, the amino group of 
Gly 67 might cyclize onto the carbonyl group of Ser 65 to form an imidazolidin-S- 
one, where the process would stop (top center) if is absent. The new N=C 
5 double bond would be expected to promote dehydrogenation to form a conjugated 
chromophore; imidazolidin-S-ones are indeed known to undergo autoxidative 
formation of double bonds at the 4-position [Kjaer, A. Acta Chem. Scand, 7, 
1030-1035 (1953); Kidwai, A. R. &Devasia, G. M. /. Org. Chem. 27, 4527-4531 
(1962)] » which is exactiy what is necessary to complete the fluorophore (upper 
10 right). The protonated and deprotonated species (upper and lower right) may be 
responsible for the 395 and 470-475 nm excitation peaks, respectively. The 
excited states of phenols are much more acidic than their ground states, so that 
emission would come only from a deprotonated species. 

The Aequorea GFP cDNA was subjected to random mutagenesis by 
15 hydroxy lamine treatment or polymerase chain reaction. Approximately six 
thousand bacterial colonies on agar plates were illuminated with alternating 395 and 
475 nm excitation and visually screened for altered excitation properties or 
emission colors. 

According to a first aspect of the present invention, modifications are 
20 provided which result in a shift in the ratio of the two excitations peaks of the 
product after oxidation and cyclization relative to the wild type. Three mutants 
were found with significant alterations in the ratio of the two main excitation peaks 
(Table I). The mutations were sequenced and recombined with die wild-type gene 
in different ways to eliminate neutral mutations and assign the fluorescence effects 
25 to single amino acid substitutions, except for H9 where two neighboring mutations 
have not yet been separated. They all lay in the C terminal part of the protein 
(Table I), remote in primary sequence from the chromophore formed from residues 
65-67. 

These and other modifications are defined herein with reference to the 
30 amino acid sequence [SEQ ID N0:2] encoded by die reported cDNA (SEQ ID 
N0:1]; the first amino acid identified is the one found at the indicated location in 
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the reported sequence, while the second indicates the substitution found in the 
modified form. The fluorescent product derived from a wUd-type or modified GFP 
polypeptide sequence is no longer strictiy speaking a simple polypeptide after 
oxidation and cyclization; however, reference is sometimes made for sake of 
5 simplicity herein to the polypeptide (e.g., "wild-type GFP" or "modified GFP") 
where what is intended would be obvious from die context. Compared with wild- 
type GFP. H9 (Ser 202-Phe. Thr 203-Ile) had increased fluorescence at 395 nm 
excitation; P9 (Ue 167-.Val) and Pll (lie 167-Thr) were more fluorescem at 475 
nm excitation. 

10 one possibility for these spectral perturbations in P9 and Pll is that the 

mutations at He 167 shift a positive charge slighfly closer to ttie phenolic group of 
tiie fluorophoie: tiiis should botii increase tiie percentage of phenolic anion, which 
is probably die species responsible for tiie 47(M75 mn excitation peak, and shift 
die emission peak hypsochromically. However, tiie hypotiiesized ionizable 
15 phenoUc group would have to be buried inside die protein at normal pH. because 
tiie ratio of 471 to 396 nm peaks in tiie mutants could not be ftirtiier affected by 
external pH until it was raised to 10. just below tiie ttireshold for denamration. 
The pH-sensitivity of wild-type GFP is similar [Ward, W. W. et al.. Photochem. 
Photobiol. 35, 803-808 (1982)]. 
20 According to anotiier aspect of tiie invention, a mutant P4 (Tyr 66-His) 

was identified which was excitable by ultraviolet and fluoresced bright blue in 
contrast to tiie green of wild type protein. The excitation and emission maxima 
were hypsochromically shifted by 14 and 60 mn respectively from tiu^se of wild- 
type GFP The mutated DNA was sequenced and found to contain five ammo acid 
25 substimtions. only one of which proved to be critical: replacemem of Tyr 66 in tiie 
center of tiie chromophore by His (corresponding to a change in tiie GFP cDNA 
sequence [SEQ ID N0:11 at 196-198 from TAT to CAT). 

The surprising tolerance for substitution at tiiis key residue prompted ftirtiier 
site-directed mutagenesis to Trp and Phe at tiiis position. Trp gave excitation and 
30 emission wavelengtiis intermediate between Tyr and His (Table I) but was only 
weakly fluorescent, perhaps due to inefficiency of folding or chromophore 
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formation due to steric considerations. Phe gave weak fluorescence with an 
excitation maximum at 358 nm and an emission maximum at 442 nm. 
Accordingly, pursuant to this aspect of the invention modified GFP proteins which 
fluoresce at different wavelengths (preferably, different by at least 10 nm and more 
5 preferably, by at least 50 nm) relative to the native protein are provided, for 
example, those wherein Tyr 66 is replaced by Phe, His or Trp. 

In a further embodiment of this aspect of the invention, a double mutant 
Y66H, Y145F was identified which had almost the same wavelengths as the single 
mutant Y66H but almost twice the brightness, due mainly to a higher quantum 
10 efficiency of fluorescence. The double mutant also developed its fluorescence 
during overnight growth, whereas the single mutant required several days. 

In accordance with further embodiments of this aspect of the invention, a 
first round of mutagenesis to increase the brightness of Y66W yielded 
M153TA^163A/N212K as additional substitutions. This mutant was subjected to 
15 another round of mutagenesis, resulting in two further sets, N146I and 
I123V/Y145H/H148R (Table II). The quanmm efficiency of these mutants is now 
comparable to wild-type GFP. The clustering of the substimtions in residues 145 
to 163 suggest that those residues lie relatively close to the chromophore and that 
reductions in the size of their side chains might be compensating for the larger size 
20 of tryptophan compared to tyrosine. 

Pursuant to yet another aspect of the present invention, modified GFP 
proteins are provided which provide substantially more intense fluorescence per 
molecule than the wild type protein. Modifications at Ser 65 to Ala. Leu, Cys, 
Val, He or Thr provide proteins with red-shifted and brighter spectra relative to the 
25 native protein. In particular, the Thr mutant (corresponding to a change in the 
GFP cDNA sequence [SEQ ID N0:1] at 193-195 from TCT to ACT) and Cys 
mutant (corresponding to a change in the GFP cDNA sequence [SEQ ID NO: 1] at 
193-195 from TCT to TGT) arc about six times brighter than wild type when 
excited at the preferred long-wavelength band above 450 nm. As a consequence, 
30 these modified proteins are superior to wild type proteins for practically all 
applications. Further, the brightness of these modified proteins matches the 
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brighmess reported in the literature for ReniUa GFP; thus, these proteins clearly 
obviate the objections to the dimness of Aeguorea GFP. In fact, it is speculated 
that the chromophores in these modified proteins may exhibit the optmimn 
brighmess which could be achieved with a general structure derived from the 
5 Aeguorea GW chromophor^. In particular, these mutations provide products 
exhibiting one or more of the following salient characteristics which distmguish 
them clearly over the corresponding product from a wild-type GFP: reduced 
efficiency of excitation by wavelengths between about 350 and 420 mn; enhanced 
excitation and emission efficiency when excited with wavelengths longer than about 
10 450 nm; increased resistance to light-induced shifts in the excitation spectrum; and 
faster kinetics of fluorophore generation. In contrast, mutations to Trp. Arg. Asn, 
Phe and Asp did not provide improved brightness. 

Mutagenesis of S65T to shift its wavelengths further to the red yielded 
M153A/K238E(Tablen) as the GFP variant witf, the longest-wavelengthexcitation 

15 maximum yet described. 504 nm v.. 490 nm for S65T. Surprisingly, the em.ss.on 
peak hardly changed (514 nm vs. 511 mn). so that the separation between the 
excitation and emission peaks (Stokes' shift) is extremely narrow, only 10 mn. TTus 
is one of the smallest values reported for any fluorophore in aqueous solution at 
room temperature. As in the Y66W series. M153 seems to be influential. It is 

20 doubtful that K238E is important, because this substitution has been found to be 

without effect in other mutants. 

AS would be readily apparent to those working in the field, to provide the 
desired fluoiescent protein it would not be necessary to include the entire sequence 
of GFP In particular, minor deletions at either end of the protein sequence are 
25 expected to have littie or no impact on the fluorescence spectrum of the protem. 
Therefore, by a mutant or wild-type GFP sequence for purposes of the present 
invention are contemplated not only the complete polypeptide and oligonucleotide 
sequences discussed herein, but also functionally-equivalem portions thereof (i.e.. 
ponions of the polypeptide sequences which exhibit the desired fluorescence 
30 properties and oligonucleotide sequences encoding these polypeptide sequences . 
For example, whereas the chromophore itself (position 65-67) is obviously crucial. 
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the locations of known neutral mutations suggest that amino acids 76-1 15 are less 
critical to the spectroscopic properties of the product. In addition, as would be 
immediately apparent to those working in the field, the use of various types of 
fusion sequences which lengthen the resultant protein and serve some functional 
5 purpose in the preparation or purification of the protein would also be routine and 
are contemplated as within the scope of the present invention. For example, it is 
common practice to add amino acid sequences including a polyhistidine tag to 
facilitate purification of the product proteins. As such fusions do not significantly 
alter the salient properties of the molecules comprising same, modified GFPs as 
10 described herein including such fusion sequences at either end thereof are also 
clearly contemplated as within the scope of the present invention. 

Similarly, in addition to the specific mutations disclosed herein, it is well 
understood by those working in the field that m many instances modifications in 
particular locations in the polypeptide sequence may have no effect upon the 
15 properties of the resultant polypeptide. Unlike the specific mutations described in 
detail herein, other mutations provide polypeptides which have properties 
essentially or substamially indistinguishable ft-om those of the specific polypeptides 
disclosed herein. For example, the following substitutions have been found to be 
neutral (i.e., have no significant impact on the properties of the product): Lys 
20 3-*Arg; Asp 76-*Gly; Phe 99^Ile; Asn 105-Ser; Glu 1 15-Val; Thr 225-*^Ser; and 
Lys 238-M31U. These equivalent polypeptides (and oligonucleotide sequences 
encoding these polypeptides) are also regarded as within the scope of the present 
invention. In general, the polypeptides and oligonucleotide sequences of the 
present invention (in addition to containing at least one of the specific mutations 
25 identified herein) will be at least about 85% homologous, more preferably at least 
about 90% homologous, and most preferably at least about 95% homologous, to 
the wild-type GFP described herein. Because of the significant difference in 
properties observed upon introduction of the specified modifications into a GFP 
sequence, the presence of the specified modifications relative to the corresponding 
30 reported sequence for wild-type GFP [SEQ ID NO: 2] are regarded as central to the 
invention. 
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The oligonucleotide sequences of the present invention are particularly 
useful in processes for labeUing polypeptides of interest, e.g.. by the construction 
of genes encoding fluorescent fusion proteins. Fluorescence labeling via gene 
ftasion is site-specific and eliminates the present need to purify and label protems 
5 wrn> and microinject them into ceUs. sequences encoding the modified GFPs 

of the present invention may be used for a wide variety of purposes as are well 
known to ftose working in the field. For example, the sequences may be 
employed as reporter genes for monitoring the expression of the sequence fused 
thereto; unlike other reporter genes, the ^quences require neither substrates nor 
10 cell disruption to evaluate whether expression has be achieved. Similarly, Uie 
sequences of the present invention may be used as a means to trace lineage of a 
gene fused thereto during the development of a cell or organism. Further, the 
sequences of the present invention may be used as a genetic maricer; cells or 
organisms labeled in this mamter can be selected by. e.g. . fluorescence-activated 
15 cell sorting. The sequences of the present invention may also be used as a 
fluorescent tag to monitor protein expression in vivo, or to encode donors or 
acceptors for fluorescence resonance energy transfer. Other uses for the sequences 
of the present invention would be readily apparem to those working in the field, 
as would appropriate techniques for fusing a gene of interest to an oligonucleotide 
20 sequence of the present invention in the proper reading frame and in a suitable 
expression vector so as to achieve expression of the combined sequence. 

The availability of several forms of GFP with such different spectral 
properties should facilitate two-color assessment of differential gene expression, 
developmental fate, or protein trafficking. For example, if one wanted to screen 
25 for a drug that is specific to activate expression of gene A but not gene B. one 
could fuse the cDNA for one color of GFP to the promoter region of gene A and 
fuse the CDNA for another color to the promoter region of gene B. Both 
constructs would be transfected into target cells and the candidate drugs could be 
assayed to determine if they stimulate fluorescence of the desired color, but not 
30 fluorescenceoftheundesiredcolor. Similarly, one could test for the simultaneous 



wo 96A23810 



PCT/US95/14692 



11 

expression of both A and B by searching for the presence of both colors 
simultaneously. 

As another example, to examine the precise temporal or spatial relationship 
between the generation or location of recombinant proteins X and Y within a cell 
5 or an organism, one could fiise genes for different colors of GFP to the genes for 
proteins X and Y, respectively. If desired, DNA sequences encoding flexible 
oligopeptide spacers could be included to allow the linked domains to function 
autonomously in a single construct. By examining the appearance of the two 
distinguishable colors of fluorescence in the very same cells or organisms, one 
10 could compare and contrast the generation or location of the proteins X and Y with 
much greater precision and less biological variability than if one had to compare 
two separate sets of cells or organisms, each containing just one color of GFP 
fused to either protein X or Y. Other examples of the usefulness of two colors 
would be obvious to those skilled in the art. 
15 The further mutations to brighten the Y66H and Y66W variants of GFP 

enhance the possibility of using two or three colors of fluorescent protein to track 
differential gene expression, protein localizations or cell fates. For example, 
mutants P4-3 {Y66H/Y145F), W7 (Y66W/N146I/M153T/V163A/N212K) and 
S65T can all be distinguished from each other. P4-3 is specifically detected by 
20 exciting at 290-370 nm and collecting emission at 420-460 nm. W7 is specifically 
detected by exciting at 410-457 nm and collecting emission at 465-495 nm. S65T 
is specifically detected by exciting at 483-493 nm and collecting emission at 
wavelengths greater than 510 nm. Bacteria carrying these three proteins are 
readily discriminated under a microscope using the above wavelength bandpass 
25 filters. 

The chromophore in GFP is well buried inside the rest of the protein, so 
much of the dimness of the original point mutants was presumably due to steric 
mismatch between the substituted amino acid and the cavity optimized for tyrosine. 
The location of the beneficial mutations implies that residues 145-163 are probably 
30 close to the chromophore. The M153A/S65T mutant has the longest wavelengths 
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15 



12 

and smallest Stokes' shift of any known fluorescent protein that does not use a 
cofactor. 

The invention may be better understood with reference to the accompanying 
examples, which are intended for purposes of illustration only and should not be 
construed as in any sense limiting the scope of the invention as defined by the 
claims appended hereto. 
Exam ple 1 

The coding region of GFP clone 10.1 [Prasher et al. (1992). supra] was 
amplified by PGR to create Ndel and BamHI sites at the 5' and 3' ends, 
respectively, and was cloned behind the T7 promoter of pGEMEX2 (Promega) 
replacing most of the T7 gene 10. The resulting plasmid was transformed into the 
strain JM109(DE3) (Promega Corp. . Madison. WI), and high level expression was 
achieved by growing the culwies at 24»C to saturation without induction by IPTG. 
To prepare soluble extracts. 1.5 ml cell suspension were collected, washed and 
resuspended in 150 m1 50 mM Tris/HCl. pH 8.0, 2 mM EDTA. Lysozyme and 
DNAse I were added to 0.2 mg/ml and 20 Mg/ml. respectively, and tiie samples 
were incubated on ice until lysis occurred (1-2 hours). The lysates were tiien 
clarified by centrifuging at 12,000 x g for 15 minutes. Inclusion bodies were 
obtained as described in the literamre [Sambrook, J. et al. in Molecular Cloning: 
A Laboratory Manual Vol. 2. 17.37-17.41 (Cold Spring Harbor Press. Cold Spring 

Hartwr. New York, 1989)]. 

AS illustrated in Fig. 1, soluble extracts of £. coli expressing GFP show a 
predominant band which is absent in extracts from control cells and has the same 
electrophoretic mobility as native GFP isolated from tiie jellyfish A. victona. 
25 Inclusion bodies of expressing cells consist mainly of non-fluorescent GFP which 
has tiie same mobility as soluble GFP. Non-fluorescent soluble GFP of 
anaerobically grown culmres is also a major band witii correct mobility. Soluble 
extracts of tiie mutated clones H9. P9, Pll and P4 again contain a dommam 

protein witii essentially tiie same molecular weight. 
30 Random mutagenesis of tiie GFP cDNA was done by increasing tiie error 

rate of tiie polymerase chain reaction witi, 0.1 mM MnCl„ 50 ^M dATP and 200 



20 
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of dGTP, dCTP, and dlTP [Muhlrad, D. et aL, Yeast 8, 79-82 (1992)]. The 
product was ligated into pGEMEX2 and subsequently transformed into 
JM109(DE3). Colonies on agar were visually screened for different emission 
colors and ratios of brightness when excited at 475 vs. 395 nm. 
5 Figs. 3a and 3b illustrate the excitation and emission spectra of wild-type 

and mutant GFPs. In Figs. 3a and 3b, wild-type; S202F.T203I; - - 

- I167T; Y66W; — • — • Y66H. Samples were soluble firactions from £. 

coli expressing the proteins at high level, except for Y66W. which was obtained 
in very low yield and measured on intact cells. Autofluorescence was negligible 

10 for all spectra except those of Y66W, whose excitation spectrum below 380 nm 
may be contaminated by autofluorescence. Excitation and emission spectra were 
measured with 1.8 nm band widths and the non-scanning wavelength set to the 
appropriate peak. Excitation spectra were corrected with a rhodamine B quantum 
counter, while emission spectra (except for Y6(SW) were corrected for 

15 monochromator and detector efficiencies using manufacturer-supplied correction 
spectra. All amplitudes have been arbitrarily normalized to a maximum value of 
1.0. A comparison of brightness at equal protein concentrations is provided in 
Table I. 
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Variant 

Wild type 
H9 

P9 
-Pll 
P4 

W 
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labltl 

Characteristics of mutated vs. wild-Qrpe GFP 

Excitation Emission 
Maxima (mn)* Maxima (nm)^ 



Mutation 



none 

Ser 202-*Phe, 
Thr 203-»Ile 

ne 167-Val 

ne 167-»Thr- 

Tyr 6fr*His 



396 (476) 
398 

471 (396) 
471 (396) 
382 



Tyr 66-Trp 458 



508 (503) 
511 

502(507) 
502 (507) 
448 

480 



Relative' 
Fluorescence 

(el0O%) 
117%"' 

166%' 
188%' 
57%' 

n.d. 



•Values in narentheses are lower-amplitude peaks. , , . 

^Sr^ vSu^were observed when exciting at the main excitation peak; values 
15 i^pShls wrre observed when Uluminating at the lower-amphtude excttatton 

amounts of protein were used based on densitometry of gels stained with 

i'ectra recorded at excitation 395 nm were compared. 
20 Son maxima of spectra recorded at excitation 475 mn were compared^ 

Tt^ctrum of P4 recorded at 378 mn excitation was integrated and 
compared tfthe integrated emission spectrum of wild type recorded at 475 mn 
Son; both excitation and emission characteristics were corrected. 

R yample 2 

25 Ohgonucleotide-directed mutagenesis at the codon for Ser-65 of GFP cDN A 

was performed by the literamre method IKunkel. T.A. (1985) Proc. Natl. Acad. 
Sci USA 82, 488] using the Muta-Gene Phagcmid in Vitro MuUgenesis Kit version 
2 commerelally available from Bio-Rad. Richmond. CA. The method employs 
a bacterial host strain deficiem for dUTPase (dut) and uracil-N-glycosylase (ung), 

30 which results in an occasional substitution of uracil for thymine in newly- 
synthesized DNA. When the uracil-containing DNA is used as a wild-type 
template for oligonucleotide-directed in vitro mutagenesis, the complementary 
(mutant) strand can be synthesized in the presence of deoxynucleotides, ligase and 
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polymerase using the mutagenic oligonucleotide to prime DNA synthesis; the 
Version 2 kit utilizes unmodified T7 DNA polymerase to synthesize the 
complementary strand. When the heteroduplex molecule is transformed into a host 
with an active uracil-N-glycosylase (which cleaves the bond between the uracil base 
5 and the ribose molecule, yielding an apyrimidic site), the uracil-containing wild- 
type strand is inactivated, resulting in an enrichment of the mutant strand. 

The coding region of GFP cDNA was cloned into the BamHI site of the 
phagemid pRSETs from Invitrogen (San Diego, CA). This construct was 
introduced into the dut, ung double mutant E. colt strain CJ236 provided with the 
10 Muta-Gene kit and superinfected with helper phage VCSM13 (Stratagene, La Jolla, 
CA) to produce phagemid particles with single-stranded DNA containing some 
uracils in place of thymine. The uracil-containing DNA was purified to serve as 
templates for in vitro synthesis of the second strands using the mutagenic 
nucleotides as primers. The DNA hybrids were transformed into the strain 
15 XLlblue (available from Stratagene), which has a functional uracil-N-glycosylase; 
this enzyme inactivates the parent wild-type DNA strand and selects for mutant 
clones. DNA of several colonies were isolated and checked for proper mutation 
by sequencing. 

To express the mutant proteins, the DNA constructs obtained by 
20 mutagenesis were transformed into E, colt strain BL21(DE3)LysS (Novagen, 
Madison, WI), which has a chromosomal copy of T7 polymerase to drive 
expression from the strong T7 promotor. At room temperature 3 ml cultures were 
grown to saturation (typically, overnight) without induction. Cells from 1 ml of 
culmre were collected, washed and finally resuspended in 100 ^1 of 50 mM Tris 
25 pH 8.0, 300 mM NaCl. The cells were then lysed by three cycles of 
freeze/thawing (liquid nitrogen/30** C water bath). The soluble fraction was 
obtained by pellctting cell debris and unbroken cells in a microftige. 

To facilitate purification of the recombinant proteins, the vector used fuses 
a histidine tag (6 consecutive His) to the N-terminus of the expressed proteins. 
30 The strong interaction between histidine hexamers and Ni^* ions permitted 
purification of the proteins by NI-NTA resin (available commercially from Qiagen, 
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Chats^orth. CA). Microcolumns (10 .1 bed volume) were loaded with 100 .1 
soluble extract (in 50 mM Tris pH 8.0. 300 mM NaCl). washed with 10 bed 
volumes of the same buffer and with 10 volumes of the buffer containing 20 mM 
imidazole, m recombinant proteins were then eluted with the same buffer 
5 containing 100 mM imidazole. 

AUquots of the purified mutant GFP proteins were run along w.th wild-type 
GFP on a denamring polyacrylamide gel. The gel was stained with Coomasste 
blue and the protein bands were quantified by scamiing on a densitometer. Based 
on these results, equal amounts of each version of protein were used to run 
10 fluorescence emission and excitation spectra. 

Figs 4a and 4b compare the excitation and emission spectra of wild-type 
and Ser 65 mutants. In Fig. 4a. — S65T; - - S65A; - - - S65C; - • - • 
wild-type (emission at 508 nm). In Fig. 4B. — S65T; - - S65A; - - - S65C; 

. . . wild-type (excitation at 395 nm); wild-type (excitation at 475 mn). 

15 Excitation and emission spectra were measured with 1.8 nm bandwidths and *e 
non-scamung wavelength set to the approprUte peak. As is apparent from Fig. 4b. 
all three mutants exhibited substantially higher intensity of emission relative to the 
wild-type protein. 

Fig 5 illustrates tlK rates ol fluorophore fomation in wild-type GFP ai«l 

20 in the Ser 65-Thr mutatt t. E. ccU exptessing either wild-type or mutant GFP «ere 

grown anaerobically. At time-O. each sample was exposed » air; funher growd, 

and protein synthesis w«c prevented hy trausfcrnug .he cells to nutri»t-free 

„.edium also containing sodi^n azlde as a metabolic it^ihitor. Fh.o.escer« was 

„,b^y monitored as a function of titne. For each culture, Ute f.uor«cence 

25 intensities are «tptessed as a fracdon of d« f™al fluorescence intensity obtamcd 

at t = 18 to 20 hours, after oxidation had proceeded to completion. From F.S. 5, 

i. is apparent that developmem of fluoresce-^ prc«eds much more tpckl, .n the 
mutam.ha«inwiId-t««GFP.evenaf.ernorma.l«tionoftheabso.u.ebnghmesses 

(Figs 4. and 4b). Thetefore. when .he developmei« of GFP fluorescence .s used 
30 as an'assay for promotor activation and gene expression, the mutam clearly gives 
a more rapid and faithful measure than wild-type protein. 
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Figs. 6a and 6b illustrate the behavior of wild-type GFP and the Ser 
65-*Thr mutant, respectively, upon progressive irradiation with ultraviolet light. 
Numbers indicate minutes of exposure to illumination at 280 nm; intensity was the 
same for both samples. Wild-type GFP (Fig. 6a) suffered photoisomerization, as 
5 shown by a major change in the shape of the excitation spectrum. Illumination 
with broad band (240-400 nm) UV caused quahtatively similar behavior but with 
less increase of amplitude in the 430-500 nm region of the spectrum. The 
photoisomerization was not reversible upon standing in the dark. This 
photoisomerization would clearly be undesirable for most uses of wild-type GFP, 
10 because the protein rapidly loses brighmess when excited at its main peak near 395 
nm. The mutant (Fig. 6b) showed no such photoisomerization or spectral shift. 

Example 3 

GFP cDNAs encoding for Tyr66-His (Y66H), Tyr66-Trp (Y66W), or 

15 Ser65-*Thr (S65T) were separately further mutagcnized by the polymerase chain 
reaction and transformed into £. coli for visual screening of colonies with unusual 
intensities or colors. Isolation, spectral characterization (Table II and Fig. 7), and 
DNA sequencing yielded several additional useful variants. 

Random mutagenesis of the gfjp cDNA was done by increasing the error 

20 rate of the PGR with 0.1 mM MnClj and unbalanced nucleotide concentrations. 
The GFP mutants S65T, Y66H and Y66W had been cloned into the BamHl site 
of the expression vector pRSETB (Invitrogen), which includes a T7 promoter and 
a polyhistidine tag. The GFP coding region (shown in bold) was flanked by the 
following 5' and 3' sequences: 5*-G GAT CCC CCC GCT GAA TTC ATG ... 

25 AAA TAA TAA GGA TCC-3'. The 5* primer for the mutagenic PGR was the T7 
primer matching the vector sequence; the 3' primer was 5'-GGT AAG CTT TTA 
TTT GTA TAG TTC ATG CAT GCC-3'. specific for the 3' end of GFP. creating 
a Hindni restriction site next to the stop codon. Amplification was over 25 cycles 
(1 min at 94* C. 1 min 52' C, 1 min 72' C) using the AmpliTaq polymerase from 

30 Perkin Elmer. Four separate reactions were run in which the concentration of a 
different nucleotide was lowered from 200 fiM to 50 ^M. The PGR products were 
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combined, diE««d wi* BanM »«. HtadDl »d l,ga«d » U>e pRSETB cu, wi* 
Ba«HI »«i HiBdffl. TlK ligadon muture was <lialy«d agate. waBr. dried and 
,„bse<p=nfly .nnisfonned m,o *e b=c»=rial strain BUI(DE3) by elecTopotation 
(50 ,1 eleorocompettn. cells in 0.1 em cuve«s, 1900 V. 200 obm, 25 ,F). 
5 colonies on agar «e,e usually screened for brighmess as p«viously de«:nb«l 
hcKin. The selecKd clones were sequenced wift UK Sequenase version 2.0 ta. 
from United Stales Biocbemical. 

C„,„„es with freshly .rmsfom«d cells wete grown at 37" C to an optical 
density of 0.8 a, 600 ntn, then i«l»«d with 0.4 mM isopropylthiogalactoside 
,0 overnight a. rootn tentpcrature. Cells were washed in PBS pH 7.4. resusp«.ded ,n 
50 Tris pH 8.0, 300 n*l Naa and lysed in a Flt^ch press. The polyh.st«iuK- 
tagged OFF protete were purifled from cleared lysates on nickeUheUte columns 
(Qiagen) using 100 mM imida«>le in the above buffer to elute the protem. 

Exciudon spectra were obtained by coUecting emission at the respeatve 
,5 p«* wavelengths and we„ corrected by a Rhodamine B quantum conmcr 
Emission spectra were likewise measured at the respective excitaUon peaks and 
were corrected using factors from the fluorometer manufaco^er (Spex Indusmes, 
Edison NJ). in cleavage experiments emisston spectra were recorded at exctunon 
368 nm. For measuring molar extinction coefficients, 20 to 30 ,B of protein were 
20 used in 1 ml of PBS pH 7.4. Quantum yields of wild-typ. QFP. S65T, and F4.1 
mutants were estUnated by comparison with fluorescein in 0.1 N NaOH as a 
sundard of quanntm yield 0.91 [ed. Miller, J.N., S.cn.icris In nuoresce.ce 
Si^crcme-ry (Chapman and Ml, New York, 1981)1. Mutants F4 and P4-3 were 
likewise compat^ to S-amino-acridine in water (quanhtm yield 0.98). W2 and W7 
25 were compared to both standards, whtehfommately gave concordam results. 

Fig 7 illustrates the flootescence excitation and emission spectra of 
diffen=nt GFP mutants. All spectra were normalted to a maximal value of 1. 
Each pair of excimion and emission specnum is depicted by a distinct line style. 
The fluotescence properues of dK obutined GFP mutams are reponed m 

30 Table II. 
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Tal) lg ^ 

Fluorescence properties of GFP mutants 

^ Qone Mutations Excitation Emission ExtincLCoeff. Quannun 

inax(nm) max (nm) (M'W) yield 

P4-3 Y66H 381 445 14,000 0J8 

10 Y145F 

W7 Y66W 433(453) 475(501) 18.000(17.100) 0.67 

N146I 
M153T 
V163A 

^5 N212K 

W2 Y66W 432(453) 480 10.000(9,600) 0.72 

I123V 
Y145H 

20 H148R 
M153T 
V163A 
N212K 

P4-1 S65T 504 (396) 514 14.500(8.600) 0.54 

25 M153A 
K238E 



30 
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SEQUENCE LISTING 



(1) CEMERAL IMFORMATIOl: 

<i) APPLICAUT: T»ieft, Roger Y. 

Heira, RoQer 

CU) TITU OF lUVENTICW: IKDIFIED OREEN FLUORESCENT PROTEINS 
(iii) NUMBER OF SEQUENCES: 2 

IS) lil F^9^^<« street, Suite 500 

(C) CITY: Los Angeles 
(0) STATE: Califomi* 

(E) COUNTRY: USA 

(F) ZIP: 90O12 

(V) COHPVnER READABLE FORH: 

(A) MEDIUM TYPE: Floppy disk 

fHi COMPUTER: IBM PC cocnpatfble 

?c\ OPERATING SYSTEM: PC-OOS/HS-DOS . 

[I] SrE: Patentln Release #U0, Version #1.25 

<vi> CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: SpitsU, John P. 

(B) REGISTRATION NUMBER: 29.215 

CC) REFERENCE/DOCKET NUMBER: 1279-178 

<ix) TELECOMMUNICATION INfOW^^iS*; 

(A) TELEPHONE: (213) 977-1001 

(B) TELEFAX: (213) 977-1003 



(2) INFORMATION FOR SEQ ID N0:1: 

<i) SEQUENCE CHARACTERISTICS: 

(A) length: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CONA 



(U) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 1..717 



<xi) SEQUENCE DESCRIPTION: SEQ ID K0:1: 

;s s: s z z z s s; k r.{ ;r. J~ 

20 25 
C=T CAA CGT CAT CCA ACA TAC COA AAA CTT ACC CTT AAA TTT ATT TC^ 

Cly Glu Civ Asp Ala Thr Tyr Gly lys teu in ,^ 

ACT ACT CGA AAA CTA CCT CTT CCA TCG CCA ACA CTT CTC ACT ACT TTC 



48 



96 



144 



192 
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Thr Thp Gly Lys Leu Pro V«l Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

TCT TAT OCT GTT CAA TCC TTT TCA AGA TAC CCA GAT CAT ATG AAA CCG 
Ser Tyr Gly Val Gin Cys Phe Scr Arg Tyr Pro Asp His Met lys Arg 
65 70 75 80 

CAT GAC TTT TTC AAG AGT GCC ATG CCC CAA GGT TAT GTA CAG GAA AGA 
His Asp Phe Phe Lys Ser Ala Net Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

ACT ATA TTT TTC AAA GAT GAC 6GC AAC TAC AAG ACA CGT GCT GAA GTC 
Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

AAG TTT GAA GGT CAT ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT ATT 
Lys Phe Gtu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
115 120 125 

GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA TTG GAA TAC AAC 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
130 135 140 

TAT AAC TCA CAC AAT GTA TAC ATC ATG CCA CAC AAA CAA AAG AAT GGA 

Tyr Asn Ser His Asn Vol Tyr lie Met Ala Asp Lys Gin Lys Asn Gly 
K5 150 155 160 

ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT GAA GAT GGA AGC GTT 
lie Lys Val Asn Phe Lys lie Arg His Asn He Glu Asp Gly Ser Val 
165 170 175 

CAA CTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT GGC CCT 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
180 185 190 

GTC CTT HA CCA CAC AAC CAT TAC CT6 TCC ACA CAA TCT GCC CTT TCG 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT CTT GAG TH GTA 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
210 215 220 

ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA TAC AAA TA 
Thr Ale Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



(2) INFORMATIOH FOR 5EQ ID N0:2: 

(i) SEOUEMCE CHARACTERISTICS: 

(A) LENGTH: 238 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(if) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 
15 10 15 

Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cys 
35 40 45 

Thr Thr Gly Lys Leu Pro Vol Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
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65 



70 



75 



fiO 



Hi. ASP Phe Phe ly, S.r *l. H.t Pro Glu Ciy Tyr Val Cln Clu Ar, 

A<1 



Thr lie Phe Ph. Ly. *.p *.P Cly Asn Tyr Lys thr Arfl Al. Clu V.l 
100 

ty, Phe GIU Gly A.P Thr Leu V.l A,n Ar, Ue 6lu L^ Lys Cly He 

ASP Phe Ly. Olu ASP Gly *.n U. L.u Gly HI. Lys Leu Clu Tyr Asn 

130 

Tyr Asn Ser HU Asn V.l Tyr Ue «et Al. Asp Ly. Cln Ly. Asn Cly 
Z Lys val Asn Phe Lys lie Ar, His Ajn Ue C.u Asp Cly Ser v.l 

Gin Leu Al. ASP HU Tyr Gin Gin Asn Thr Pro Ue Gly Asp Cly Pro 

ISO 

^ * « Mill Tvr Leu Ser Thr Gin Ser Ala Leu Ser 
val Leu Leu Pro Asp Asn His lyr Leu «r 

195 *™ 
ty. ASP Pro Asn Glu Lys Arg Asp His Het V.l Leu Leu Clu Phe V.l 

210 , 215 

Thr Ala Ala Ciy Ue Thr His Cly Hat Asp Glu Leu Tyr Lys 
225 230 
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WHAT IS CLAIMED IS: 

1 . A composition of matter comprising a fluorescent product derived 
from a modified fonn of an Aequorea wild-type GFP polypeptide, characterized 

5 in that upon oxidation and cyclization of amino acid residues in the modified form 
corresponding to positions 65 to 67 of wild-type GFP polypeptide sequence [SEQ 
ID NO: 2] a product exhibiting a different excitation and/or emission spectrum from 
a corresponding product derived from the wild-type GFP polypeptide sequence is 
formed. 

10 

2. A composition according to claim 1, wherein the product exhibits 
an alteration in the ratio of two main excitation peaks relative to the corresponding 
product derived from wild-type GFP. 

15 3. A composition according to claim 2, wherein increased fluorescence 

is exhibited at a shorter-wavelength peak of the two main excitation peaks, 

4. A composition according to claim 3, wherein the modified form of 
the wild-type GFP sequence comprises a replacement of Ser at a position 

20 corresponding to position 202 in the wild-type GFP sequence by Phe and a 
replacement of Thr at a position corresponding to position 203 by He. 

5. A composition according to claim 2, wherein increased fluorescence 
is exhibited at a longer- wavelength peak of the two main excitation peaks. 

25 

6. A composition according to claim 5, wherein the modified form of 
the wild-type GFP sequence comprises a replacement of He at a position 
corresponding to position 167 of the wild-type GFP sequence by Val or Thr. 

30 7. A composition according to claim 5, wherein the modified form of 

the wild-type GFP sequence comprises a replacement of Ser at a position 
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corresponding to position 65 of the wild-type GFP sequence by Thr. a replacement 
of Met at position 153 with Ala, and a replacement of Lys at position 238 wtth 
Glu. 

5 8. A composition according to claim 1. wherein the product fluoresces 

at a shorter wavelength than the corresponding product derived from wild-type 
GFP. 

9 A composition according to claim 8. wherein the modified form of 
10 the wild-type GFP sequence comprises a replacement of Tyr at a position 

corresponding to position 66 of the wild-type GFP sequence by Phe. His or Trp. 

10 A composition according to claim 8, wherein the modified form of 
the wild-type GFP sequence comprises a replacement of Tyr at a position 

15 corresponding to position 66 of the wild-type GFP sequence by His and a 
replacement of Tyr at position 145 with Phe. 

11. A composition according to claim 8. wherein the modified form of 
the wild-type GFP sequence comprises a replacemem of Tyr at a position 
20 corresponding to position 66 of the wild-type GFP sequence by Trp. a replacement 
of Asn at position 146 by He. a replacement of Met at position 153 by Thr. a 
replacement of Val at position 163 by Ala. and a replacement of Asn at positron 
212 by Lys. 

25 12 A composition according to claim 8. wherein the modified form of 

the wild-type GFP sequence comprises a replacemem of Tyr at a position 
corresponding to position 66 of the wild-type GFP sequence by Trp, a replacemem 
of He at position 123 by Val, a replacemem of Tyr at position 145 by H.s. a 
replacement of His at position 148 by Arg. a replacement of Met at position 153 

30 by Thr, a replacemem of Val at position 163 by Ala. and a replacemem of Asn at 
position 212 by Lys. 



i 
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13. A composition according to claim 1, wherein the product exhibits 
enhanced emission relative to the corresponding product derived from wild-type 
GFP. 

5 14. A composition according to claim 13, wherein the modified form of 

the wild-type GFP sequence comprises a replacement of Ser at a position 
corresponding to position 65 of the wild-type GFP sequence by an amino acid 
selected from the group consisting of Ala, Cys, Thr, Leu, Val and He. 

10 15 . A composition according to claim 14, wherein the amino acid is Cys 

or Thr. 

16. A substantially pure oligonucleotide sequence encoding a modified 
form of an Aequorea wild-type GFP polypeptide sequence according to any one of 

15 claims 1-15, 

17. A method for monitoring expression of a gene encoding a 
polypeptide, comprising: 

forming a combined sequence comprising the gene and an 
20 oligonucleotide sequence according to claim 16, in which combined 

sequence both the gene and the oligonucleotide sequence are in the same 
reading frame; and 

observing for fluorescence characteristic of a product derived from 
a polypeptide sequence encoded by the oligonucleotide sequence, the 
25 fluorescence indicating expression of a fusion protein encoded by the 

combined sequence. 
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18 A method for simultaneously monitoring expression of a first gene 
and a second gene in a single cell, tissue or organism, the first gene encoding a 
polypeptide different from a polypeptide encoded by the second gene, said method 
comprising: 

5 forming a first combined sequence comprising the first gene and a 

f,rst oligonucleotide sequence according to claim 16. in which first 
combined sequence both the first gene and the first oligonucleotide 
sequence are in the same reading frame; 

forming a second combined sequence comprising the second gene 

10 and a second oligonucleotide sequence selected from the group consisting 

of an oligonucleotide sequence encoding a wild-type GFP and an 
oligonucleotide sequence according to claim 16 different from the first 
oligonucleotide sequence, in which second combined sequence both the 
second gene and the second oligonucleotide sequence are in the same 

15 reading frame; and 

observing for fluorescence characteristic of products derived from 
polypeptide sequences encoded by the first and second oligonucleotide 
sequences, the fluorescence indicating expression of fusion proteins encoded 
by the combined sequences. 

19 A method according to claim 18. wherein agents are screened for 
utility in activating expression of at least one of the first and second genes. 

20. A method according to claim 18. wherein locations of proteins 
25 expressed from the first and second genes are compared. 

21. A method according to claim 18. wherein temporal sequence of 
expression of the first and second genes is determined. 
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22. A method for monitoring transcription of a gene, comprising: 

forming a combined sequence comprising upstream regulatory 
elements of the gene and an oligonucleotide sequence according to claim 
16, in which combined sequence both the upstream regulatory elements and 
the oligonucleotide sequence are in the same reading frame; and 

measuring the fluorescence characteristics of a product derived from 
a polypeptide sequence encoded by the oligonculeotide sequence, the 
fluorescence indicating transcription of the oligonucleotide imder the control 
of the upstream regulatory elements of the gene. 



10 



20 



23. A method for simultaneously monitoring expression of a first gene 
and a second gene in a single cell, tissue or organism, the first gene encoding a 
polypeptide different from a polypeptide encoded by the second gene, said method 
comprising: 

15 forming a first combined sequence comprising upstream regulatory 

elements of the first gene and a first oligonucleotide sequence according to 
claim 16, in which first combined sequence both the upstream regulatory 
elements of the first gene and the first oligonucleotide sequence are in the 
same reading frame; 

forming a second combined sequence comprising the upstream 
regulatory elements of a second gene and a second oligonucleotide sequence 
selected from the group consisting of an oligonucleotide sequence encoding 
a wild-type GFP and an oligonucleotide sequence according to claim 16 
different from the first oligonucleotide sequence, in which second combined 
25 sequence both the upstream regulatory elements of the second gene and the 

second oligonucleotide sequence are in the same reading frame; and 

observing for fluorescences characteristic of products derived from 
polypeptide sequences encoded by the first and second oligonucleotide 
sequences, the fluorescences indicating transcription under the control of 
30 the respective upstream regulatory elements of the first and second genes. 



wo 96/23810 



1/7 



PCT/US95/14692 




FIG. 1 



SUBSTITUTE SHEET (RULE 26) 



wo 96/23810 



PCT/US95/14692 




wo 96/23810 



J/7 



PCT/US95/14692 



i 



N 



FIG. 5a 



100 



V 



Y66W y Y \ J167T 



S 0.75- ^x.^ 



0.50 



§ 0.25 



0.00 



\ ^ 




300 



FIG. 3b 



V S202F, 
\ T203I_^ 

I 



350 400 450 

WAVELENGTH (nm) 



500 



1.25 



"a 100 



I '''' 



^ 0.50- I 



i 0.25 



0.00 



/ 




^00 450 500 550 

WAVELENGTH (nm) 

SUBSTITUTE SHEET (RULE 26) 



wo 96/23810 



PCT^S95/14692 



4/7 

FIG. 4a 




WAVELENGTH, nm 
SUBSTITUTE SHEET (RULE 26) 



wo 96/23810 PCT/US95/14692 

5/7 



FIG. 5 




SUBSTITUTE SHEET (RULE 26) 



wo 96/23810 



PCT/US95yi4692 



e/7 



FIG. 6a 

^^+5 I 1 1 r— T T , , T , r 




Oe+0^ — ■ ' ■ ' ' r — i — I — I ' • 

325 350 375 400 425 450 475 500 525 

EXCITATION-WAVELENGTH (nm) 

FIG. 6b 




525 



EXCITATION-WAVELENGTH fnm) 
SUBSTITUTE SHEET (RULE 26) 



wo 96/23810 



PCT/US95/I4692 




NORMALIZED FLUORESCENCE 
SUBSTITUTE SHEET (RULE 26) 



INTERNATIONAL SEARCH REPORT 



Imenutional application No. 
PCT/US95/I4692 



I A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :Pteai« See Extra Sheet. 

US CL :435/6; 530/330; 536/23^ 
I According to Intenmionai g>ateat Claiiification (IPC ) or to both national classification and IPC 

B. FIELDS SEARCHED 

Mtnimum documenUlton searched (cUssiBcation system followed by classiAcation synibols) 
U.S. : 435/6; 530/350; 536/23.5 

I Documentation searched other than minimum documentation to the extent that such documenu are included in the fields searched 



I Electronic daU base consulted during the international search (name of daU base and. where practicable, search terms used) 

APS, MEDLINE, BIOSIS. CAPLUS, SCISEARCH 
search terms; green fluorescent protein 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Ciution of document, with indication, where appropriate, of the relevant pasuges 



Relevant to claim No. 



P, X 

|P, Y 

|P, X 
IP. Y 



FEBS Letters, Vol. 367, No. 2, issued 26 June 1995, Ehrig 
et al., "Green fluorescent protein mutants with altered 
fluorescence excitation spectra", pages 163-166, see pages 
163-165. 



Proceedings of the National Acadenny of Sciences, Vol. 91, 
No. 26, issued 20 December 1994, Heim et al., "Wavelength 
mutations and posttranslational autoxidation of green 
fluorescent protein", pages 12501-12504, see entire 
document. 



1-3, 5, 8, IB- 
IS, 20, 22, 23 

4, 6, 7, 9-15, 
19, 21 

1-6, 8, 9, 13, 
16-18, 20, 22, 
23 

7, 10, 14, 15, 
19, 21 



[rx| Further documenu are listed in the continuation of Box C. Q See patent family annex. 



•A' 



SpeckJ GBte(onca of cttad documents: 

docimaiidcfiains <bc tcaaml tate of the tit wfaicfa » man mrm^tni 
to be pert of perticular rtkveace 

ceriicf doni m mi pwbliriied on or eftcr the intemetiottej faini date 

documoa which amy thnnr doufata oo prioriiy clura(«) or which 
cited 10 MteMieh the pubUceiMa d«to of wiathcf citKion or oth 
•paciel wi ii n o («e apocifled) 

docuoMBt referrias lo en onl dadoeura. im. eihtbitiea or otfiar 

docunempiAliihed prior to the intcfwiioeel rdi&t date but k 
(he pfionty dif * 



Itfcr documeatpuhliihed eftcr the istefmtioaal rilinf < 
dite end fiot io eosnid wiA (he eppUoetioo bm cited to 
priociple or theory taderfyi^ (he kivcatiaB 



donanest of pefticnkr rcleviDae: (he deined mvanioa 
coeeidcred novel or CBoool be 



I of penieutar rcleveaor. the cleioMd 
d lo involve en ravcothm Mcp when 
ooobiDod with ooc or aofc other euch docuacali. 
bciot obvioiM to e pom tkUkd b the on 



family 



I Date of the actusl completion of the international search 
17 JANUARY IW6 



Date of mailing of the intemattonal search report 



2 1 FEB 1996 



{ Name and mailing address of the ISA/US 
Commisnoner of Ptienu and Trsdemarts 
Box PCT 

Weshtngton. D.C. 20231 
I Facsimile No. a03) 305-3230 



Autboriadd ofHccr 



Telephone No. a03) 308-0196 



Form PCT/ISA/210 (second sheet)(JuIy 1992>« 



INTERNATIONAL SEARCH REPORT 

(Continution). DOCUMENTS CONSIDERED TO BE RELEVANT 



IntenutioMi application No. 



P.X 
P. Y 
P, Y 

P.X 
P. Y 

A 
Y 



Citation or docuin cnt. with indication, where appropriate, of the relev«.t pamtei 

Biorrechnology. Vol. 13, No. 2. issued Febmaiy 1995. Delagrave 
et al., -Red-shifted excitation mutants of the green nuorescent 
protein", pages 151-154, see pages 151-153. 

BioTechniques. Vol. 19. No. 4, issued October 1995, Kain et al.. 
-Green fluorescent protein as a reporter of gene expression and 
protein localization", pages 650-655. see entire document. 

Nature Vol. 373. No. 6516, issued 23 February 1995, Heimet 
al., "Improved green fluorescence", pages 663-664, sec entire 
document. 

Science, Vol. 263. issued 11 February 1994, Chalfie et al., 
•Green fluorescent protein as a marker for gene expression . 
pages 802-805, see entire document. 



Relevant to claim No. 

1, 2, 5, 17-23 



3, 4, 6-15 
1-23 



1.2,5, 8, 13-16 

3. 4, 6, 7. 9-12. 
17-23 



1-23 
ll-23 



Fonn PCT/ISA/210 (continuaUon of tecond ih«»HJuly 1992)« 



INTERNATIONAL SEARCH REPORT 



IntemAiional appUcAtion No. 
PCTAJS95/J4692 



A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (6): 



C07H 21/04; C07K 14/435; C12Q 1/25, 1/68 



Fonn PCT/ISA/210 (cjurm shcei)(July 1992)* 



